Subdomain Entry Vocabulary Modules Evaluation
نویسنده
چکیده
Subdomain entry vocabulary modules represent a way to provide a more specialized retrieval vocabulary in a particular subject area. Several subdomain indexes have been derived for an analysis using the INSPEC database. The results show that subdomain indexes differ significantly from each other and from the general-purpose index they were derived from. The document pools that could be retrieved using the different subdomain entry vocabulary modules also differ greatly. If a word can be understood in more than one sense (polysemy), it is more likely to lead to different output from the individual subdomain indexes. Evaluation of the prediction power of subdomain Entry Vocabulary Modules shows that more specific Entry Vocabulary Modules are more precise in predicting correct subject headings for given documents in a subject area. Related papers and reports: Michael K. Buckland, Aitao Chen, Michael Gebbie, Youngin Kim, & Barbara Norgard. Variation by Subdomain in Indexes to Knowledge Organization Systems. http://www.sims.berkeley.edu/research/metadata/iskopaper.html Youngin Kim. Evaluation of the Sensitivity of Subdomain in EVM dictionary approach Technical Report, 2000. http://www.sims.berkeley.edu/research/metadata/papers/subdomain00.html Youngin Kim. Evaluation of the performance of the EVM dictionaries. Technical Report, 2000. http://www.sims.berkeley.edu/research/metadata/eval_desc.htm Vivien Petras. Variation on Subdomain Indexes Technical Report, 2000. http://www.sims.berkeley.edu/research/metadata/papers/subdvariation.html
منابع مشابه
GIRT and the Use of Subject Metadata for Retrieval
The use of domain-specific metadata (subject keywords) is tested for monolingual and bilingual retrieval on the GIRT social science collection. A new technique, Entry Vocabulary Modules, which adds subject keywords selected from the controlled vocabulary to the query, has been tested. As in previous years, we compare our techniques of thesaurus matching and Entry Vocabulary Modules to simple ma...
متن کاملAdvanced Search Technologies for Unfamiliar Metadata
Searching of databases (textual or numeric) is likely to be effective and efficient only if the user is familiar with the classification, categorizing, and indexing schemes (metadata vocabularies) being searched. Therefore, it is obviously beneficial to provide a bridge between the user’s ordinary language and the metadata vocabularies of the unfamiliar database in order to compensate for abbre...
متن کاملExtracting Modules from Ontologies: Theory and Practice
The ability to extract meaningful fragments from an ontology is essential for ontology re-use. We propose a definition of a module that guarantees to completely capture the meaning of a given set of terms, i.e., to include all axioms relevant to the meaning of these terms, and study the problem of extracting minimally sized modules. We show that the problem of determining whether a subset of an...
متن کاملSemantic Processing of out - of - Vocabulary Words in Aspoken Dialogue
One of the most important causes of failure in spoken dialogue systems is usually neglected: the problem of words that are not covered by the system's vocabulary (out-of-vocabulary or OOV words). In this paper a methodology is described for the detection, classii-cation and processing of OOV words in an automatic train timetable information system 2]. The various extensions that had to be eeect...
متن کاملLarge Vocabulary Continuous Speech Recognition: Improvements in Acoustic Modelling and Search
This paper describes the main improvements we made in two of the basic modules in our HMMbased large vocabulary speaker independent continuous speech recognition system: namely in the acoustic modelling and in the search engine. For the acoustic modelling, we paid special attention both to improved parameter tying at the density and at the state level, and to fast evaluation of the HMMs. For th...
متن کامل